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(SEQ ID NO:l-nucleotide sequence and SEQ ID NO:2-amino acid sequence) 
Escherichia coli B Phytase Sequence 



1 

ATG AAA GCG ATC TTA ATC CCA TTT TTA TCT CTT CTG ATT CCG TTA ACC CCG 
Met Lys Ala lie Leu lie Pro Phe Leu Ser Leu Leu lie Pro Leu Thr Pro 

CAA TCT GCA TTC GCT CAG AGT GAG CCG GAG CTG AAG CTG GAA AGT GTG GTG 
Gin Ser Ala Phe Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser Val Val 

ATT GTC AGT CGT CAT GGT GTG CGT GCT CCA ACC AAG GCC ACG CAA CTG ATG 
lie Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr Gin Leu Met 

CAG GAT GTC ACC CCA GAC GCA TGG CCA ACC TGG CCG GTA AAA CTG GGT TGG 
Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val Lys Leu Gly Trp 

CTG ACA CCG CGN GGT GGT GAG CTA ATC GCC TAT CTC GGA CAT TAC CAA CGC 
Leu Thr Pro Arg Gly Gly Glu Leu lie Ala Tyr Leu Gly His Tyr Gin Arg 

CAG CGT CTG GTA GCC GAC GGA TTG CTG GCG AAA AAG GGC TGC CCG CAG TCT 
Gin Arg Leu Val Ala Asp Gly Leu Leu Ala Lys Lys Gly Cys Pro Gin Ser 

GGT CAG GTC GCG ATT ATT GCT GAT GTC GAC GAG CGT ACC CGT AAA ACA GGC 
Gly Gin Val Ala lie lie Ala Asp Val Asp Glu Arg Thr Arg Lys Thr Gly 

GAA GCC TTC GCC GCC GGG CTG GCA CCT GAC TGT GCA ATA ACC GTA CAT ACC 
Glu Ala Phe Ala Ala Gly Leu Ala Pro Asp Cys Ala lie Thr Val His Thr 

CAG GCA GAT ACG TCC AGT CCC GAT CCG TTA TTT AAT CCT CTA 7VAA ACT GGC 
Gin Ala Asp Thr Ser Ser Pro Asp Pro Leu Phe Asn Pro Leu Lys Thr Gly 

GTT TGC CAA CTG GAT AAC GCG TU^C GTG ACT GAC GCG ATC CTC AGC AGG GCA 
Val Cys Gin Leu Asp Asn Ala Asn Val Thr Asp Ala lie Leu Ser Arg Ala 

GGA GGG TCA ATT GCT GAC TTT ACC GGG CAT CGG CAA ACG GCG TTT CGC GAA 
Gly Gly Ser lie Ala Asp Phe Thr Gly His Arg Gin Thr Ala Phe Arg Glu 

CTG GAA CGG GTG CTT AAT TTT CCG C7^ TCA AAC TTG TGC CTT AAA CGT GAG 
Leu Glu Arg Val Leu Asn Phe Pro Gin Ser Asn Leu Cys Leu Lys Arg Glu 

AAA CAG GAC GAA AGC TGT TCA TTA ACG CAG GCA TTA CCA TCG GAA CTC AAG 
Lys Gin Asp Glu Ser Cys Ser Leu Thr Gin Ala Leu Pro Ser. Glu Leu Lys 

GTG AGC GCC GAC AAT- GTC TCA TTA ACC GGT GCG GTA AGC CTC GCA TCA ATG 
Val Ser Ala Asp Asn Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met 

CTG ACG GAG ATA TTT CTC CTG CAA CAA GCA CAG GGA ATG CCG GAG CCG GGG 
Leu Thr Glu lie Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly 

TGG GGA AGG ATC ACC GAT TCA CAC CAG TGG AAC ACC ^TTG CTA AGT TTG CAT 
Trp Gly Arg lie Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His 

AAC GCG CAA TTT TAT TTG CTA CAA CGC ACG CCA GAG GTT GCC CGC AGC CGC 
Asn Ala Gin Phe Tyr Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser Arg 

GCC ACC CCG TTA TTG GAT TTG ATC ATG GCA GCG TTG ACG 'CCC CAT CCA CCG 
Ala Thr Pro Leu Leu Asp Leu He Met Ala Ala Leu Thr Pro His Pro Pro 
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pHyTemperature Profile and Stability 
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Figure 3 : 
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Thermal Tolerance of E coli Expressed Proteins 
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Figure 7a 

E. coli appA (GenBank accession no. M58708 ) (SEQ ID NO:7) 

1 taaggagcag aaacaatgtg gtatttacti tggttcgtcg gcattttgtt gatgtgttcg 
61 ctctccaccc ttgtgttggt atggctggac ccgcgtctga aaagttaacg aacgtaggcc 
121 tgatgcggcg cattagcatc gcatcaggca atcaataatg tcagatatga aaagcggaaa 
181 catatcgatg aaagcgatct taatcccatt tttatctctt ctgattccgt taaccccgca 
241 atctgcattc gctcagagtg agccggagct gaagctggaa agtgtggtga ttgtcagtcg 
301 tcatggtgtg cgtgctccaa ccaaggccac gcaactgatg caggatgtca ccccagacgc 
361 atggccaacc tggccggtaa aactgggttg gctgacaccg cgnggtggtg agctaatcgc 
421 ctatctcgga cattaccaac gccagcgtct ggtagccgac ggattgctgg cgaaaaaggg 
481 ctgcccgcag tclggtcagg tcgcgattat tgctgalgtc gacgagcgta cccgtaaaac 
541 aggcgaagcc ttcgccgccg ggctggcacc tgactgtgca ataaccgtac atacccaggc 
601 agatacgtcc agtcccgatc cgttatttaa tcctctaaaa actggcgttt gccaactgga 
661 taacgcgaac gtgactgacg cgatcctcag cagggcagga gggtcaattg ctgactttac 
721 cgggcatcgg caaacggcgt ttcgcgaact ggaacgggtg cttaattttc cgcaatcaaa 
781 cttgtgcctt aaacgtgaga aacaggacga aagctgttca ttaacgcagg cattaccatc 
841 ggaactcaag gtgagcgccg acaatgtctc attaaccggt gcggtaagcc tcgcatcaat 
901 gctgacggag atatttctcc tgcaacaagc acagggaatg ccggagccgg ggtggggaag 
961 gatcaccgat tcacaccagt ggaacacctt gctaagtttg cataacgcgc aattttattt 
1021 gctacaacgc acgccagagg ttgcccgcag ccgcgccacc ccgttattag atttgatcaa 
1081 gacagcgttg acgccccatc caccgcaaaa acaggcgtat ggtgtgacat tacccacttc 
1 141 agtgctgttt atcgccggac acgatactaa tctggcaaat ctcggcggcg cactggagct 
1201 caactggacg cttcccggtc agccggataa cacgccgcca ggtggtgaac tggtgtttga 
1261 acgctggcgt cggclaagcg ataacagcca gtggattcag gtttcgctgg tcttccagac 
1321 tttacagcag atgcgtgata aaacgccgct gtcattaaat acgccgcccg gagaggtgaa 
1381 actgaccctg gcaggatgtg aagagcgaaa tgcgcagggc atgtgttcgt tggcaggttt 
1441 tacgcaaatc gtgaatgaag cacgcatacc ggcgtgcagt ttgtaatgca taaaaaagag 
1 501 cattcagtta cctgaatgct ctgaggctga tgacaaacga agaactgtct aatgcgtaga 
1 561 ccggaaaagg cgttcacgcc gcatccggcc actttcagtt ttcctctttc tcggagtaac 
1621 tataaccgta atagttatag ccgtaactgt aagcggtgct ggcgcgttta atcacaccat 
1681 tgaggatagc gcctttaata ttgacgcctg cctgttccag acgctgcatt gacaaactca 
1741 cctctttggc ggtgttcaag ccaaaacgcg caaccagcag gctggtgcca acagaacgcc 
1801 ccacgaccgc ggcatcactc accgccagca tcggcggcgt atcgacaatc accagatcgt 
1861 aatggtcgtt cgcccattcc agtaattgac gcatccgatc g 
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1 taaggagcag aaacaatgtg gtatttactt tggttcgtcg gcattttgtt gatgtgttcg 

61 ctctccaccc ttgtgttggt atggctggac ccgcgtctga aaagttaacg aacgtaggcc 
121 igatgcggcg cattagcatc gcatcaggca atcaataatg tcagatatga aaagcggaaa 
181 catatcgatg aaagcgatct taatcccatt tttatctctt ctgattccgt taaccccgca 
241 atctgcattc gctcagagtg agccggagct gaagctggaa agtgtggtga ttgtcagtcg 
301 tcatggtgtg cgtgctccaa ccaaggccac gcaactgatg caggatgtca ccccagacgc 
361 atggccaacc tggccggtaa aactgggttg gctgacaccg cgngglggtg agctaatcgc 
421 ctatctcgga cattaccaac gccagcgtct gglagccgac ggattgctgg cgaaaaaggg 
481 ctgcccgcag tctggtcagg tcgcgattat tgctgatgtc gacgagcgta cccgtaaaac 
541 aggcgaagcc ttcgccgccg ggctggcacc tgactgtgca ataaccgtac atacccaggc 
601 agatacgtcc agtcccgatc cgttatttaa tcctctaaaa actggcgttt gccaactgga 
661 taacgcgaac gtgactgacg cgatcctcag cagggcagga gggtcaattg ctgactttac 
721 cgggcatcgg caaacggcgt ttcgcgaact ggaacgggtg cttaattttc cgcaatcaaa 
781 cttgtgcctt aaacgtgaga aacaggacga aagctgttca ttaacgcagg cattaccatc 
841 ggaactcaag gtgagcgccg acaatgtctc attaaccggt gcggtaagcc tcgcatcaat 
901 gctgacggag atattlctcc tgcaacaagc acagggaatg ccggagccgg ggtggggaag 
961 gatcaccgat tcacaccagt ggaacacctt gctaagtttg cataacgcgc aattttattt 
1021 gclacaacgc acgccagagg ttgcccgcag ccgcgccacc ccgttattag atttgatcaa 
1081 gacagcgttg acgccccatc caccgcaaaa acaggcgtat ggtgtgacat tacccacttc 
1141 agtgctgttt atcgccggac acgatactaa tctggcaaat ctcggcggcg cactggagct 
1 201 caactggacg cttcccggtc agccggataa cacgccgcca ggtggtgaac tggtgtttga 
1 261 acgctggcgt cggclaagcg ataacagcca gtggattcag gtttcgctgg tcttccagac 
1 321 tttacagcag atgcgtgata aaacgccgct gtcatlaaat acgccgcccg gagaggtgaa 
1381 actgaccctg gcaggatgtg aagagcgaaa tgcgcagggc atgtgttcgt tggcaggttt 
1441 tacgcaaatc gtgaatgaag cacgcatacc ggcglgcagt ttgtaatgca taaaaaagag 
1 501 cattcagtta cctgaatgct ctgaggctga tgacaaacga agaactgtct aatgcgtaga 
1 561 ccggaaaagg cgttcacgcc gcatccggcc actttcagtl ttcctctttc tcggagtaac 
1 621 tataaccgta atagttatag ccgtaactgt aagcggtgct ggcgcgttta atcacaccat 
1681 tgaggatagc gcctttaata ttgacgcctg cctgttccag acgctgcatt gacaaactca 
1741 cctctttggc ggtgttcaag ccaaaacgcg caaccagcag gctggtgcca acagaacgcc 
1 801 ccacgaccgc ggcatcactc accgccagca tcggcggcgt atcgacaatc accagatcgt 
1861 aatggtcgtt cgcccattcc agtaattgac gcatccgatc g 
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Amino acid sequence for E. coli appA (wild type) (SEQ ID NO:8) 

MKAILIPFLSLLIPLTPQSAFAQSEPELKLESVVIVSRHGVRAPTKATQtMQDVT 

PDAWPTWPVKLGWLTPRGGELIAYLGHY<2RQRLVADGLLAKKGCPQSGQVA 

IIADVDERTRKTGEAFAAGLAPDCAITVHTQADTSSPDPLFNPLKTGVCQLDNA 

NVTDAILSRAGGSIADFTGHRQTAFRELERVLNFPQSNLCLKREKQDESCSLTQ 

ALPSELKVSADNVSLTGAVSLASMLTEIFLLQQAQGMPEPGWGRITDSHQWNT 

LLSLHNAQFYLLQRTPEVARSRATPLLDLIKTALTPHPPQKQAYGVTLPTSVLFI 

AGHDTNLANLGGALELNWTLPGQPDNTPPGGELVFERWRRLSDNSQWIQVSL 

WQTLQQMRDKTPLSLNTPPGEVKLTLAGCEERNAQGMCSLAGFTQIVNEARI 

PACSL 

Bold-Underlined amino acid residues are shown below in the modified appA enzyme 
(SEQIDNO:10) 

MKAILIPFLSLLIPLTPQSAFAQSEPELKLESVVIVSRHGVRAPTKATQLMQDVT 

PDAWPTWPVKLGELTPRGGELIAYLGHYWRQRLVADGLLPKCGCPQSGQVAI 

lADVDERTRKTGEAFAAGLAPDCAITVHTQADTSSPDPLFNPLKTGVCQLDNA 

NVTDAILERAGGSIADFTGHYQTAFRELERVLNFPQSNLCLKREKQDESCSLTQ 

ALPSELKVSADCVSLTGAVSLASMLTEIFLLQQAQGMPEPGWGRITDSHQWNT 

LLSLHNAQFDLLQRTPEVARSRATPLLDLIKTALTPHPPQKQAYGVTLPTSVLFI 

AGHDTNLANLGGALELNWTLPGQPDNTPPGGELVFERWRRLSDNSQWIQVSL 

WQTLQQMRDKTPLSLNTPPGEVKLTLAGCEERNAQGMCSLAGFTQIVNEARI 

PACSL 




